Frequency warping based on mapping formant parameters

نویسندگان

  • Zhiwei Shuang
  • Raimo Bakis
  • Slava Shechtman
  • Dan Chazan
  • Yong Qin
چکیده

We propose a novel method of generating a frequency warping function by mapping formant parameters of the source speaker and the target speaker. Alignment and selection process are performed to ensure that the mapping formants can represent speakers’ difference well. This approach requires only a very small amount of training data for generating the warping function, which can greatly facilitate its application. It can also achieve high quality of the converted speech while successfully converting a speaker’s identity. A practical voice morphing system has been built based on this approach. And experimental results show its effectiveness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probability models of formant parameters for voice conversion

This paper explores the estimation and mapping of probability models of formant parameter vectors for voice conversion. The formant parameter vectors consist of the frequency, bandwidth and intensity of resonance at formants. Formant parameters are derived from the coefficients of a linear prediction (LP) model of speech. The formant distributions are modelled with phonemedependent two-dimensio...

متن کامل

Formant-based frequency warping for improving speaker adaptation in HMM TTS

Vocal Tract Length Normalization (VLTN), usually implemented as a frequency warping procedure (e.g. bilinear transformation), has been used successfully to adapt the spectral characteristics to a target speaker in speech recognition. In this study we exploit the same concept of frequency warping but concentrate explicitly on mapping the first four formant frequencies of 5 long vowels from sourc...

متن کامل

Formant model estimation and transformation for voice morphing

In this paper we consider the estimation and mapping of timevarying formant model parameters and orders for voice transformation. The model order is the number of perceptually significant formant trajectories estimated from an analysis of the poles of “over-modelled’’ linear prediction models of the source and target speech. A 2-D HMM with NF left–to-right states across frequency and M states a...

متن کامل

不需平行語料而基於共振峰與線頻譜頻率映對之語者特質轉換系統 (A Voice Conversion System based on Formant and LSF Mapping without Using Parallel Corpus) [In Chinese]

Voice conversion has been used in many applications. The methods based on vector quantization codebook and Gaussian mixture models need dynamic time warping on parallel sentence corpus for generating mapping functions. Recent study tries to use less training data, and even without parallel sentence corpus. This paper presents a voice conversion method without using parallel sentence corpus. It ...

متن کامل

Optimization of Formant Feature Based Speech Recognition

The paper deals with the use of formant features in dynamic time warping based speech recognition. These features can be simply visualized and give a new insight into understanding the reasons of speech recognition errors. The formant feature extraction method, based on the singular prediction polynomials, has been applied in recognition of isolated words. However, the speech recognition perfor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006